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Summary in context: Searching versus browsing 
Daniel M. McDonald, Hsinchun Chen 

January 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue l 
Publisher: ACM Press 

Full text available: | | r>cif(530.99 KB) Additional Information: full citation, abstract, references, index terms 

The use of text summaries in information-seeking research has focused on query-based 
summaries. Extracting content that resembles the query alone, however, ignores the 
greater context of the document. Such context may be central to the purpose and 
meaning of the document. We developed a generic, a query-based, and a hybrid 
summarizer, each with differing amounts of document context. The generic summarizer 
used a blend of discourse information and information obtained through traditional 
surface- ... 

Keywords: Summarization, browse, generic summaries, indicative summaries, 
information seeking, natural language processing, search, text processing 



2 Summari^ 

Jian-Tao Sun, Dou Shen, Hua-Jun Zeng, Qiang Yang, Yuchang Lu, Zheng Chen 
August 2005 Proceedings of the 28th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '05 

Publisher: ACM Press 

Full text available - W \paf(*57 70 Kt-3) Additional Information: MLstation, abstract, references, citing's, index 

" terms 

Most previous Web-page summarization methods treat a Web page as plain text. However, 
such methods fail to uncover the full knowledge associated with a Web page needed in 
building a high-quality summary, because many of these methods do not consider the 
hidden relationships in the Web. Uncovering the hidden knowledge is important in building 
good Web-page summarizers. In this paper, we extract the extra knowledge from the 
clickthrough data of a Web search engine to improve Web-page summarization ... 

Keywords: clickthrough data, generic web-page summarization, latent semantic analysis, 
thematic lexicon 
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Links for.a.betterwe^ 

J.-Y. Delort, B. Bouchon-Meunier, M. Rifqi 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia HYPERTEXT '03 

Publisher: ACM Press 

Full text available - pcfff167 68 KB) Additional Information: Miration, abstract, references, cjtings, jndex 
^ ™ terms 

This paper addresses the issue of Web document summarization. As textual content of 
Web documents is often scarce or irrelevant and existing summarization techniques are 
based on it, many Web pages and websites cannot be suitably summarized. We consider 
the context of a Web document by the textual content of all the documents linking to it. To 
summarize a target Web document, a context-based summarizer has to perform a 
preprocessing task, during which it will be decided which pieces of informati ... 

Keywords: context, hyperlinks, summarization, web document 



Autonomous authoring tools for hypertext 
MarkTruran, James Goulding, Helen Ashman 

September 2007 ACM Computing Surveys (CSUR), volume 39 issue 3 
Publisher: ACM Press 

Full text available: ||) pdf(667.35 KB) Additional Information: full citation , abstract , references , index terms 

Autonomous authoring tools are routinely used to expedite the translation of large 
document collections into functioning hypertexts. They are also used to add hyperlinks to 
pre-existing hypertext structures. In this survey we describe a taxonomy of autonomous 
hypertext authoring tools. The classification of any given system is determined by the type 
and nature of the document analysis it performs. 
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Keywords: Authoring tools, automatic hypertext generation systems, link generation 



Social software: From social bookrnarking to social summarization: an experiment in 
$ community-based summary generation 
Oisin Boydell, Barry Smyth 

January 2007 Proceedings of the 12th international conference on Intelligent user 

interfaces IUI '07 
Publisher: ACM Press 

Full text available: |§.£df(741 ,75 KB) Additional Information: fuM.citatjon, abstract, Merences, in^ex terms 

We describe a novel document summarization technique that uses informational cues, 
such as social bookmarks or search queries, as the basis for summary construction by 
leveraging the snippet-generation capabilities of standard search engines. A 
comprehensive evaluation demonstrates how the social summarization technique can 
generate summaries that are of significantly higher quality that those produced by a 
number of leading alternatives. 

Keywords: click-through data, community, social bookmarks, summarization, web search 
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Krisztian Balog, Leif Azzopardi, Maarten de Rijke 

August 2006 Proceedings of the 29th annual international ACM SIGIR conference on 

Research and development in information retrieval SIGIR '06 
Publisher: ACM Press 

Full text available: ^j)dj{232,79 KBjt Additional Information: MLcjtation, abstract, references, indexierms 

Searching an organization's document repositories for experts provides a cost effective 
solution for the task of expert finding. We present two general strategies to expert 
searching given a document collection which are formalized using generative probabilistic 
models. The first of these directly models an expert's knowledge based on the documents 
that they are associated with, whilst the second locates documents on topic, and then 
finds the associated expert. Forming reliable associations is c ... 

Keywords: enterprise search, expert finding 
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1 Retrieving aM 

Wen-Syan Li, K. Selguk Candan, Quoc Vu, Divyakant Agrawal 



April 2001 Proceedings of the 10th international conference on World Wide Web 
WWW '01 

Publisher: ACM Press 

Full text available: ^£dfC1.97„M3) Additional Information: fuJLcjtatjon, references, citingLS, index tenins 



Keywords: link structures, progressive processing, query relaxation, web proximity 
search 



2 Appli^tio^ 

i& graph propagation 

Jingjing Liu, Wei Lai, Xian-Sheng Hua, Yalou Huang, Shipeng Li * 

September 2007 Proceedings of the 15th international conference on Multimedia 

MULTIMEDIA '07 
Publisher: ACM Press 

Full text available: pdf(406.64 KB) Additional Information: full citation, abstract, references , index terms 

This paper 1 is concerned with the problem of multimodal fusion in video search. First, we 
employ an object-sensitive approach to query analysis to improve the baseline result of 
text-based video search. Then, we propose a PageRank-like graph-based approach to text- 
based search result re-ranking. To better exploit the underlying relationship between video 
shots, the proposed re-ranking scheme simultaneously leverages textual relevancy, 
semantic concept relevancy, and I ... 

Keywords: multi-graph propagation, multimodal fusion, object-sensitive, pagerank 
algorithm, query analysis, re-ranking, video search 



3 Industrial and practical experience track paper session 2: The infocious web search 

engine: jmp^ 
^ Alexandras Ntoulas, Gerald Chao, Junghoo Cho 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web WWW v 05 

Publisher: ACM Press 
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Full text available: ^pdf(227:M Kg) Additional Information: Ml citation, abstract, references, index tenro 

In this paper we present the Infocious Web search engine [23], Our goal in creating 
Infocious is to improve the way people find information on the Web by resolving 
ambiguities present in natural language text. This is achieved by performing linguistic 
analysis on the content of the Web pages we index, which is a departure from existing 
Web search engines that return results mainly based on keyword matching. This additional 
step of linguistic processing gives Infocious two main advantages. First> ... 

Keywords: concept extraction, crawling, indexing, information retrieval, language 
analysis, linguistic analysis of web text, natural language processing, part-of-speech 
tagging, phrase identification, web search engine, web searching, word sense 
disambiguation 



ScentTraiis: Integrating browsing and searching on the Web 
Christopher Olston, Ed H. Chi 

September 2003 ACM Transactions on Computer-Human Interaction (TOCHI), Volume 10 

Issue 3 

Publisher: ACM Press 

Full text available: fjl pdf(654 98 KB) Additional Information: MLcjtatjon, abstract, references, citings, Index 
*® terms, review 

The two predominant paradigms for finding information on the Web are browsing and 
keyword searching. While they exhibit complementary advantages, neither paradigm alone 
is adequate for complex information goals that lend themselves partially to browsing and 
partially to searching. To integrate browsing and searching smoothly into a single 
interface, we introduce a novel approach called ScentTraiis. Based on the concept of 
information scent developed in the context of information foraging theory, ... 

Keywords: ScentTraiis, World Wide Web, browsing, information scent, searching 



5 Web resource crawling and searching: Pushing task relevant web links down to the 
M desktop 

w Paul - Alexandru Chirita, Claudiu S. Firan, Wolfgang Nejdl 

November 2006 Proceedings of the eighth ACM international workshop on Web 

information and data management WIDM '06 
Publisher: ACM Press 

Full text available: f| pdfi21 9. 36 K8J Additional Information: MLcitatipn, abstract, references, cited.by, index 
^ * terms 

Searching the web has become a task in many people's work, without which subsequent 
tasks would be hard to carry out or even impossible. But as people tend to have less time 
for querying the web or even for searching their personal computer for information they 
need, it becomes common to skip information gathering activities like trying to find useful 
resources on the web because of the "effort" it takes to query a web search engine. In this 
paper we propose to use software agents that collect u ... 

Keywords: document summarization, just-in-time information retrieval, personalized web 
search, user profile 



Machine learning in automated text categorization 
Fabrizio Sebastian! 

March 2002 ACM Computing Surveys (CSUR), volume 34 issue l 
Publisher: ACM Press 
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Full text available: ^.pdf(524:.4i KB) Additional Information: MLcjMion, abstract, references., citings, jndex 

The automated categorization (or classification) of texts into predefined categories has 
witnessed a booming interest in the last 10 years, due to the increased availability of 
documents in digital form and the ensuing need to organize them. In the research 
community the dominant approach to this problem is based on machine learning 
techniques: a general inductive process automatically builds a classifier by learning, from 
a set of preclassified documents, the characteristics of the categories. ... 

Keywords: Machine learning, text categorization, text classification 



Links for a better web: Enhanced web document summarization using hyperlinks 
J.-Y. Delort, B. Bouchon-Meunier, M. Rifqi 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia HYPERTEXT '03 

Publisher: ACM Press 

Full text available: pdf(167.83 K3) Additional Information: Mcitatjon, attract, references, citings, index 
^ *' A " terms 

This paper addresses the issue of Web document summarization. As textual content of 
Web documents is often scarce or irrelevant and existing summarization techniques are 
based on it, many Web pages and websites cannot be suitably summarized. We consider 
the context of a Web document by the textual content of all the documents linking to it. To 
summarize a target Web document, a context-based summarizer has to perform a 
preprocessing task, during which it will be decided which pieces of informati ... 

Keywords: context, hyperlinks, summarization, web document 



Machine learning for IR: Learning effective ranking functions for newsgroup search 
Wensi Xi, jesper Lind, Eric Brill 

July 2004 Proceedings of the 27th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '04 

Publisher: ACM Press 

Full text available: ffipilft281.il KB) Additional Information: full citation, abstract , references , citings, index 
^ ^ " ""* terms 

Web communities are web virtual broadcasting spaces where people can freely discuss 
anything. While such communities function as discussion boards, they have even greater 
value as large repositories of archived information. In order to unlock the value of this 
resource, we need an effective means for searching archived discussion threads. 
Unfortunately the techniques that have proven successful for searching document 
collections and the Web are not ideally suited to the task of searching archive ... 

Keywords: information retrieval, linear regression, machine learning, newsgroup search, 
support vector machines 



9 Query result processing: Mining anchor text for query refinement 

Reiner Kraft, Jason Zien 
^ May 2004 Proceedings of the 13th international conference on World Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available- f HUpdfflOO 27 K3'i Additiona! Information: full citation , abstract, references, citings, index 

terms 
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When searching large hypertext document collections, it is often possible that there are 
too many results available for ambiguous queries. Query refinement is an interactive 
process of query modification that can be used to narrow down the scope of search 
results. We propose a new method for automatically generating refinements or related 
terms to queries by mining anchor text for a large hypertext document collection. We 
show that the usage of anchor text as a basis for query refinement produce ... 

Keywords: anchor text, query refinement, rank, web search 



10 Web mining research: a survey 

Raymond Kosala, Hendrik Blockeel 
^ June 2000 ACM SIGKDD Explorations Newsletter, Volume 2 issue l 

Publisher: ACM Press 

Full text available: ||| pc]fM.58 MB) Additional Information: full citation, citings , index terms 



Keywords: Web, data mining, information extraction, information retrieval 



11 IR-6 (information retrieval): digital libraries: SERF: integrating human 
i& leoommen^ 

^ Seikyung Jung, Kevin Harris, Janet Webster, Jonathan L. Herlocker 

November 2004 Proceedings of the thirteenth ACM international conference on 

Information and knowledge management CIKM '04 
Publisher: ACM Press 

Full text available: ^pdfC4i3J6.KB) Additional Information: full, citation, abstract, Merences, index terms 

Today's university library has many digitally accessible resources, both indexes to content 
and considerable original content. Using off-the-shelf search technology provides a single 
point of access into library resources, but we have found that such full-text indexing 
technology is not entirely satisfactory for library searching. 

In response to this, we report initial usage results from a prototype of an entirely new type 
of search engine - The System for Electronic Recommendation Fi ... 

Keywords: collaborative filtering, digital libraries, information retrieval, user studies, web 
search 



Dlscoyering„miss.ing„n 

Sisay Fissaha Adafre, Maarten de Rijke 

August 2005 Proceedings of the 3rd international workshop on Link discovery 
LinkKDD '05 

Publisher: ACM Press 

Full text available: ^ ndtM 83. 94 KB) Additional Information: full citation , abstract, references , index terms 

In this paper we address the problem of discovering missing hypertext links in Wikipedia. 
The method we propose consists of two steps: first, we compute a cluster of highly similar 
pages around a given page, and then we identify candidate links from those similar pages 
that might be missing on the given page. The main innovation is in the algorithm that we 
use for identifying similar pages, LTRank, which ranks pages using co-citation and page 
title information, Both LTRank and the link discovery ... 

Keywords: co-citation, link analysis, wikipedia 
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lndustria[andpractica„ Q 
pages 

Baoning Wu, Brian D. Davison 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web WWW 05 

Publisher: ACM Press 

Full text available* ff?l pd r P60 52 kp) A^'*' 01131 Information: full citation , abstract, references, citings, index 

terms 

With the increasing importance of search in guiding today's web traffic, more and more 
effort has been spent to create search engine spam. Since link analysis is one of the most 
important factors in current commercial search engines' ranking systems, new kinds of 
spam aiming at links have appeared. Building link farms is one technique that can 
deteriorate link-based ranking algorithms. In this paper, we present algorithms for 
detecting these link farms automatically by first generating a seed se ... 

Keywords: HITS, PageRank, link analysis, spam, web search engine 



EvoMogjntMigent^ 
• Edmund S. Yu, Ping C. Koo, ENzabeth D. Liddy 
June 2000 Proceedings of the fourth international conference on Autonomous agents 

AGENTS 'OO 
Publisher: ACM Press 

Full text available: ^.pdglJAMB). Additional Information: MLcjtation, references, citings, index terms 



Keywords: evolution of agents, information agents, learning and adaptation, multi-agent 
teams 



15 SALSA: the stochastic approach for link-structure analysis 
^ R. Lempel, S. Moran 

^ April 2001 ACM Transactions on Information Systems (TOIS), volume 19 issue 2 
Publisher: ACM Press 

Full text available: «| pdf(1 80.81 K3) Additional Information: full citation, abstract, references, citings, index 
' ^ ' % terms 

Today, when searching for information on the WWW, one usually performs a query 
through a term-based search engine. These engines return, as the query's result, a list of 
Web pages whose contents matches the query. For broad-topic queries, such searches 
often result in a huge set of retrieved documents, many of which are irrelevant to the 
user. However, much information is contained in the link-structure of the WWW. 
Information such as which pages are linked to others can be used to augment searc ... 

Keywords: Link-structure analysis, SALSA, TKC effect, hubs and authorities, random 
walks 



6 Improved search ranking: Bevond PageRank: machine learning for static ranking 
j& Matthew Richardson, Amit Prakash, Eric Brill 

* May 2006 Proceedings of the 15th international conference on World Wide Web 
WWW '06 

Publisher: ACM Press 

Full text available: Additional Information: 
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^.P.dfii33 : 74 .KB) Mi citation, abstract, rsferenjcgs, indexierms 

Since the publication of Brin and Page's paper on PageRank, many in the Web community 
have depended on PageRank for the static (query-independent) ordering of Web pages. 
We show that we can significantly outperform PageRank using features that are 
independent of the link structure of the Web. We gain a further boost in accuracy by using 
data on the frequency at which users visit Web pages. We use RankNet, a ranking 
machine learning algorithm, to combine these and other static features based on a ... 

Keywords: PageRank, RankNet, relevance, search engines, static ranking 



17 Relevant | 
^ search 

^ Sergei Vassilvitskii, Eric Brill 

August 2006 Proceedings of the 29th annual international ACM SIGIR conference on 

Research and development in information retrieval SIGIR '06 
Publisher: ACM Press 

Full text available: ^pdff885.86 KB) Additional Information: full citation , abstract, references, index terms 

We study the effect of user supplied relevance feedback in improving web search results. 
Rather than using query refinement or document similarity measures to rerank results, we 
show that the web-graph distance between two documents is a robust measure of their 
relative relevancy. We demonstrate how the use of this metric can improve the rankings of 
result URLs, even when the user only rates one document in the dataset. Our research 
suggests that such interactive systems can significantly improv ... 

Keywords: link analysis, relevance feedback, web search 



18 Noyelsea^ 

M search tool: a case study in the nanotechnoloqy domain 

™ Michael Chau, Hsinchun Chen, Jialun Qin, Yilu Zhou, Yi Qin, Wai-Ki Sung, Daniel McDonald 
July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 

JCDL '02 
Publisher: ACM Press 

Full text available: #| pdf(859.29 K3) Additional Information: full citation, abstract, references, citings, index 
% terms 

As the Web has been growing exponentially, it has become increasingly difficult to search 
for desired information. In recent years, many domain-specific (vertical) search tools have 
been developed to serve the information needs of specific fields. This paper describes two 
approaches to building a domain-specific search tool. We report our experience in building 
two different tools in the nanotechnology domain -- (1) a server-side search engine, and 
(2) a client-side search agent. The designs of ... 

Keywords: indexing, information retrieval, internet searching and browsing, internet 
spider, noun-phrasing, personalization, post-retrieval analysis, self-organizing map, 
summarization, vertical search engine, web search engine 



Web graphs: Web proj ections: learning from contextual subgraphs of the web 
Jure Leskovec, Susan Dumais, Eric Horvitz 

May 2007 Proceedings of the 16th international conference on World Wide Web 
WWW '07 

Publisher: ACM Press 

Full text available: *j g| pdff313.24 K3i Additional Information: Full citation , abstract, references, index terms 
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Graphical relationships among Web pages have been exploited inmethods for ranking 
search results. To date, specific graphicalproperties have been used in these analyses. We 
introduce a WebProjection methodology that generalizes prior efforts of 
graphicalrelationships of the web in several ways. With the approach, wecreate subgraphs 
by projecting sets of pages and domains onto thelarger web graph, and then use machine 
learning to constructpredictive models that consider graphical properties as ... 

Keywords: contextual subgraph, query reformulation, web graph, web projection, web 
search 



20 information access and retrieval (IAR): Template detection for large scale search 
^ engines 

^ Liang Chen, Shaozhi Ye, Xing Li 

April 2006 Proceedings of the 2006 ACM symposium on Applied computing SAC '06 
Publisher: ACM Press 

Full text available* ffk)dHliLBiiiS) Additlonal ,nformati °n: MLcitaiion, abstract, references, cjtelby, index 
^ " terms 

Templates in web sites hurt search engine retrieval performance, especially in content 
relevance and link analysis. Current template removal methods suffer from processing 
speed and scalability when dealing with large volume web pages. In this paper, we 
propose a novel two-stage template detection method, which combines template detection 
and removal with the index building process of a search engine. First, web pages are 
segmented into blocks and blocks are clustered according to their style fe ... 

Keywords: clustering, template detection, web page segmentation 
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21 Multimedia and visualization: Dynamic structuring of web information for access 
^ Yjsuailzatjon. 

^ Jess Y. S. Mak, Hong Va Leong, Alvin T. S. Chan 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing SAC '02 
Publisher: ACM Press 

Full text available: ^.pdf(765 : 23 KB) Additional Information: fuN.cltatjon, abstract, references, indexierms 

The Internet has led to the formation of a global information infrastructure. To explore a 
web site, a site map would be useful as a short cut for a user to locate for the target 
information in a structured and efficient manner, rather than drilling into the web site 
following hyperlinks, reading possibly irrelevant information. Useless information impacts a 
mobile web environment, where mobile clients are only connected with unreliable wireless 
channels of limited bandwidth. Structured web page ... 

Keywords: DOM, VRML, XML, visualization, web document structure 
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^ April 1996 Proceedings of the SIGCHI conference on Human factors in computing 
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Publisher: ACM Press 

Full text available: ^.pdf(126 MB) J" 
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Additional Information: full citation, references, citings , index terms 



Keywords: World Wide Web, hypertext, information visualization 
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^ June 2007 Proceedings of the 5th international conference on Mobile systems, 
applications and services MobiSys '07 
Publisher: ACM Press 

Full text available: ^pdf(72G : 86 K3) Additional Information: Ml .citation, abstract, references, iodex terms 

Except for a handful of "mobile" Web sites, the Web is designed for browsing using 
personal computers with large screens capable of fully rendering the content of most Web 
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pages. Browsing with handhelds, such as small-screen PDA's or cell phones, usually 
involves a lot of horizontal and vertical scrolling, thus making Web browsing time- 
consuming and strenuous. At the same time, one isoften only interested in a fragment of a 
Web page, which again may not fit on the limited-size screens of mobi ... 

Keywords: CMo, PDA, content adaptation, context-directed browsing, mobile browsing, 
partitioning, semantic blocks 



Smarter browsing: Csurf: a context-driven non-visual web-browser 
Jalal U. Mahmud, Yevgen Borodin, I. V. Ramakrishnan 

May 2007 Proceedings of the 16th international conference on World Wide Web 
WWW '07 

Publisher: ACM Press 

Full text available: ^ pdf(534.0G KB) Additional Information: full citation, abstract , references, index terms 

Web sites are designed for graphical mode of interaction. Sighted users can "cut to the 
chase" and quickly identify relevant information in Web pages. On the contrary, individuals 
with visual disabilities have to use screen-readers tobrowse the Web. As screen-readers 
process pages sequentially and read through everything, Web browsing can become 
strenuous and time-consuming. Although, the use ofshortcuts and searching offers some 
improvements, the problem still remains. In this paper, we addr ... 

Keywords: CSurf, Hearsay, context, non-visual, partitioning, screen-reader, semantic 
blocks, voice browser, web accessibility 
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April 2004 Proceedings of the 42nd annual Southeast regional conference ACM-SE 42 
Publisher: ACM Press 

Full text available- ffipdf{429 27 KB) Additional Information: .fullcjMion, abstract, references, citings, Index 
™ v """"" terms 

SA_MetaMatch, a component of the Standards Advisor (SA), is designed to find relevant 
documents through matching indices of metadata and document content. The elements in 
the metadata schema are mainly adopted from the Dublin Core (DC). The implementation 
of the XML metadata schema and coding follows the DC recommended guidelines. After 
metadata is generated manually for an unstructured document, or is extracted 
automatically from documents of well defined layout, they are stored in metadata fil ... 

Keywords: Dublin Core, document matching, index, metadata 



Constructing multi-granular and topic-focused web site maps 

Wen-Syan Li, Necip Fazil Ayan, Okan Kolak, Quoc Vu, Hajime Takano, Hisashi Shimamura 
April 2001 Proceedings of the 10th international conference on World Wide Web 

WWW '01 
Publisher: ACM Press 

Full text available: 'H pdf(3.1S MB) Additional Information: full citation, references, citings , index terms 



Keywords: decision tree algorithm, logical domain, multi-granularity, site map, topic 
distillation 
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jig£ Alexander P. Pons 

January 2006 ACM SIGMIS Database, volume 37 issue l 

Publisher: ACM Press 

Full text available: ^ pdfM.31 MB) Additional Information: full citation , abstract, references , index terms 

To date the most common means of gaining access to the Internet continues to be via 
dial-up modem connections. These slow communication channels significantly affect the 
rendering of the majority of web pages. Higher speed communications channels can 
alleviate rendering latency but based on the web page's content, delays still are incurred 
The technique of web object prefetching can expedite the presentation of web pages by 
utilizing the current web page's view time to acquire the web objects o ... . 

Keywords: semantic links, web-application, web-prefetching 
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^ Natalie Glance, Matthew Hurst, Kamal Nigam, Matthew Siegler, Robert Stockton, Takashi 
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August 2005 Proceeding of the eleventh ACM SIGKDD international conference on 

Knowledge discovery in data mining KDD '05 
Publisher: ACM Press 

Full text available: , f| pdfi629.35 KB) Additional Information: full citation, abstract , references , citings , index 
^ '*-"---"-"* terms 

Weblogs and message boards provide online forums for discussion that record the voice of 
the public. Woven into this mass of discussion is a wide range of opinion and commentary 
about consumer products. This presents an opportunity for companies to understand and 
respond to the consumer by analyzing this unsolicited feedback. Given the volume, format 
and content of the data, the appropriate approach to understand this data is to use large- 
scale web and text data mining technologies.This paper ar ... 

Keywords: computational linguistics, content systems, information retrieval, machine 
learning, text mining 
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^ Gui-Rong Xue, Hua-Jun Zeng, Zheng Chen, Yong Yu, Wei-Ying Ma, WenSi Xi, WeiGuo Fan 

November 2004 Proceedings of the thirteenth ACM international conference on 
Information and knowledge management CIKM '04 

Publisher: ACM Press 

Full text available' ffl pdff2^2 42 K3) Additional Information: full citation , abstract, references , citings, index 

The performance of web search engines may often deteriorate due to the diversity and 
noisy information contained within web pages. User click-through data can be used to 
introduce more accurate description (metadata) for web pages, and to improve the search 
performance. However, noise and incompleteness, sparseness, and the volatility of web 
pages and queries are three major challenges for research work on user click-through log 
mining. In this paper, we propose a novel iterative reinforced a ... 

Keywords: click-through data, iterative algorithm, log mining, search engine 
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What's there and what* s not?: focused crawling for missing documents in digital 
libraries 

Ziming Zhuang, Rohit Wagle, C. Lee Giles 

June 2005 Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries 
JCDL '05 

Publisher: ACM Press 

Full text available: ^.pdf(346 : 33 KB) Additional Information: MLcjtatjon, abstract, references, index tenrts 

Some large scale topical digital libraries, such as CiteSeer, harvest online academic 
documents by crawling open-access archives, university and author homepages, and 
authors' self-submissions. While these approaches have so far built reasonable size 
libraries, they can suffer from having only a portion of the documents from specific 
publishing venues. We propose to use alternative online resources and techniques that 
maximally exploit other resources to build the complete document collection of ... 

Keywords: ACM, CiteSeer, DBLP, digital libraries, focused crawler, harvesting 




Xiaoni Zhang, Kellie B. Keeling, Robert J. Pavur 
December 2000 Proceedings of the twenty first international conference on 
Information systems ICIS 'OO 

Publisher: Association for Information Systems 

Full text available: ^pdfl148.6S KB) Additional Information: full citation , references, citings , index terms 



Keywords: cluster analysis, exploratory study, information attributes, information 
presentation, measures 



32 Web crawling and exploration: A two-phase sampling technique for information 
^ extraction from hidden web databases 
™ : Y. L. Hedley, M. Younas, A. James, M. Sanderson 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management WIDM '04 

Publisher: ACM Press 

Full text available: ^.p.dfi563 s 56 K3) Additional Information: MLfiMtjon, abstract, references, indexlejms 

Hidden Web databases maintain a collection of specialised documents, which are 
dynamically generated in response to users' queries. However, the documents are 
generated by Web page templates, which contain information that is irrelevant to queries. 
This paper presents a Two-Phase Sampling (2 PS) technique that detects templates and 
extracts query-related information from the sampled documents of a database. In the first 
phase, 2PS queries databases with terms contained in their search interfac ... 

Keywords: document sampling, hidden web databases, information extraction 



33 MuJMedj.a.lnfgrmatio 

top.&rfect„speech 
™ Dulce Ponceleon, Savitha Srinivasan 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management CIKM '01 

Publisher: ACM Press 
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This paper addresses the problem of automatic detection of salient video segments for 
real-world applications such as corporate training based on associated speech 
transcriptions. We present a novel segmentation algorithm based on automatic speech 
recognition (ASR) applied to the audio track of the video. Our feature set consists of word 
n-grams extracted from the imperfect speech transcriptions. We use a two-pass algorithm 
that combines a boundary-based method with a content-based method. In th ... 

34 Msjsiiye.technojogies 

Yjsuajjyjm^ 
^ Simon Harper, Neha Patei 

October 2005 Proceedings of the 7th international ACM SIGACCESS conference on 

Computers and accessibility Assets '05 
Publisher: ACM Press 
Full text available: f§ pdff2.19MB)< & 



,™ ^ Additional Information: full citation , abstract, references, index terms 

mpv{^9;o„MINj 

Anecdotal evidence suggests that Web document summaries provide the sighted reader 
with a basis for making decisions regarding the route to take within non-linear text; and 
additional research shows that sighted people use 'Gist' summaries as decision points to 
bolster their browsing behaviour. Other studies have found that visually impaired users are 
hindered in their cognition of the content of Web-pages because users must wait for an 
entire Web-page to be read before deciding on it's usefulne ... 

Keywords: document engineering, tools, visual impairment, web 
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William M. Detmer, Edward H. Shortliffe 
August 1997 Communications of the ACM, Volume 40 Issue 8 
Publisher: ACM Press 

Full text available* filpdf'576 05 KB) Additional Information: fuij cjMion, references, citings, jndex terms, 
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i& navigation 

^ Jill Freyne, Rosta Farzan, Peter Brusilovsky, Barry Smyth, Maurice Coyle 

January 2007 Proceedings of the 12th international conference on Intelligent user 

interfaces IUI '07 
Publisher: ACM Press 

Full text available: *g pdf(1.14 MB) Additional Information: full citation , abstract, references, index terms 

The goal of this paper is to detail the integration of two "social Web" technologies - social 
search and social navigation - and to highlight the benefits of such integration on two 
levels. Firstly, both technologies harvest and harness "community wisdom" and in an 
integrated system each of the search and navigation components can benefit from the 
additional community wisdom gathered by the other when assisting users to locate 
relevant information. Secondly, by integrating search and browsing we ... 

Keywords: community- based adaptation, hypermedia, social navigation, social search, 
user interfaces 
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37 Paper session IR-10 (information retrieval): query expansion: Query expansion using ||§ 
& random 

^ Kevyn Collins-Thompson, Jamie Callan 

October 2005 Proceedings of the 14th ACM international conference on Information 
and knowledge management CIKM '05 

Publisher: ACM Press 

Full text available: Wi pdf{104.63 KB) Additional Information: Miration, abstract, .references, citings, .index 

~ terms 

It has long been recognized that capturing term relationships is an important aspect of 
information retrieval. Even with large amounts of data, we usually only have significant 
evidence for a fraction of all potential term pairs. It is therefore important to consider 
whether multiple sources of evidence may be combined to predict term relations more 
accurately. This is particularly important when trying to predict the probability of relevance 
of a set of terms given a query, which may involve bo ... 

Keywords: pseudo-relevance feedback, query expansion, random walk, semi-supervised 
learning 
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November 2005 Proceedings of the 2005 ACM workshop on Information retrieval in 

peer-to-peer networks P2PIR '05 
Publisher: ACM Press 

Full text available: f@ pdf{531 .89 K3) Additional Information: Msitafen, abstract Mereaces, stings. Index 
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Can we improve P2P search by looking into our social network? In this paper, we argue 
that P2P networks built upon specific communities (e.g., scientific social networks) could 
achieve such a goal, by providing an implicit personalization to the output results set. 
Existing work in social networks investigating co-authorship relations has shown that 
scientific collaboration networks are scale-free. At the same time, P2P systems based on 
synthesized small-world networks have emerged, with a posit ... 

Keywords: peer-to-peer networks, query forwarding strategies, scientific collaboration 
network analysis 
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January 1998 Proceedings of the 3rd international conference on Intelligent user 
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Publisher: ACM Press 

Full text available: ^.pdf(109 MB) Additional Information: Ml .citation, references, citings, index terms 
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Most News Web sites provide display ads and contextual ads, which are the main ad 
formats in Internet. Unlike display ads, contextual ad is a performance based 
advertisement and allows clients to bid for their exposure rate. Not only do Web sites save 
operation costs but also clients plan their budget flexibly. However, in addition to accuracy 
of search results, contextual ads like a common resource pool will spend much more time 
to take the same number of clicks as display ads do. In this pa ... 

Keywords: competition, contextual ads, display ads, game theory, pricing strategies 



Results 21 - 40 of 40 Result page: previous 1 2 3 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2007 ACM, Inc. 
Terms o£Usage Privacy .Policy Cpde„plEthics Contact„Us 

Useful downloads: t l Adobe Acrobat €t QuickTime l H Windows Media Player Real Player 



http://portal.acm.org/resul^ 10/16/07 



Results (page 3): +relevancy +page +words +keyword +rank +hyperlink 



Page 1 of 1 




USPTO 



Subscribe (Full Service) ResMei (Limited Service, Free) login 

Search: <$ The ACM Digital Library C The Guide 

|+relevancy ipage 



Terms used: relevancy page words keyword rank hyperlink 



I Feedback Report a problem Satisfaction 
survey. 

Found 40 of 21 2,795 



Sort results 
by 



reiev^nce m ^ Save results to a Binder Try an Advanced Search 

— 5 pa Try this search in The ACM Guide 

D j S p| a y | | ^Search Tips 

results I ®?P?.!?.^?™.t?.r.!T!.. ^Li jH Open results i 



in a new 



Results 0 - -1 of 40 



Results 0 - -1 of 40 



window 

Result page: previous 1. 2 3 
Result page: previous I 2 3 



Relevance scale DQ 



The ACM Portal is published by the Association for Computing Machinery. Copyright © 2007 ACM, Inc. 
lerms of Usage Priyacy.Poj.icx .CMeMEthjcs Contact Us 



Useful downloads: m Adobe Acrobat €1 QuickTime i l Windows Media Plaver Real Player 



http://portal.acm.org/resultsxfm7queiy-%2Brelevancy%20%2Bpage%20%2Bw 10/16/07 



IEEE Xplore# Search Result 



Page 1 of 1 



I Cm mm mm 



Si&iSearch Results 



J-Soma | Login | l.o$eyr. • Access irtfeiM.stion | Alsrls ; Purchase HisAo"/ | f. 
Vfelcatra United Statos Patent and Trademark Office 



SROWSS 



SHARCM 



XPL0R5 GUIDE 



Results for"( ( relevancy<in>metadata ) <and> ( rank<in>metadata ) )<and> ( hyperllnked&..." 

Your search matched 0 documents. 

A maximum of 100 results are displayed, 25 to a page, sorted by Relevance in Descending order. 



r. Search Options 

View Session History 
New Search 

» Key 

J££E J&L IEEE Journal or Magazine 

EST JNL IET Journal or Magazine 

E(::i S: CNf : IEEE Conference Proceeding 

EET CUP IET Conference Proceeding 

fE£S STO IEEE Standard 



Modify Search 



( ( relevancy<in>metadata ) <and> ( rank<ln> metadata ) )<and> ( hyperlinked<in>me 



HI Check to search only within this results set 

Display Format: fir Citation O Citation & Abstract 

No results were found. 

Please edit your search criteria and try again. Refer to the Help pages if you need assistance revising your search 



Help Contact Us Prtvac 

©Copyright 700$ :£ 



http://ieeexploreieee.org/search/searchresultjsp?queryl=relevancy&scopel=^ 



10/16/07 



(USPTO) IP.com Prior Art Database 



Page 1 of 2 



Ociobor it>, 200? 



USPTO 



Search 

Full Text 
Concept 
Document ID 
Recent Disclosures 



Other_ 

Prior Art Home 

Support 

Logout 



Displaying records #1 through 10 out of 28 



Result # 1 



Relevance: OOOOC* 



What Can Be Automated?: The Computer Science and Engineering Res 
Study (COSERS) 

1980- 01-01 ^ IPCOM000128748D Eni 

It is truly difficult to capture with a single question the essence of research in a diverse 
active area of science and technology, but the query in the title comes very close. This 
first posed by the late Professor George Forsythe of Stanford ... 

Result # 2 Relevance: 00000 

The ENIAC: First General-Purpose Electronic Computer 

1981- 10-01 IPCOM000129377D En< 

[Figure containing following caption omitted: Note. Arthur Burks's research and writing 
have been supported by NSP Grant No. MCS 78-26016. Alice Burks has contributed sut 
research of the historical documents, to analysis and organization of the ... 



Result # 3 Relevance: 

Targeted Search Refinement Tool (TSRT) 

2007-10-10 IPCOM000159154D Ent. 

A search tool that allows the user to target their search by selecting associated words s 
the tool or providing additional words to further refine relevant results is described hen 
has the option to select the realm in which to perform the search. ... 



Result # 4 Relevance: 000^ : C:^ : 

A UNIQUE WORD-SCANNING FACILITY FOR THE EUREKA FULL-TEXT 
INFORMATION RETRIEVAL SYSTEM 

1978-01-31 IPCOM000151422D Em 

by William Ming-Cheong Leung January 1978 Department of Computer Science Univers 
Urbana-Champaign Urbana, Illinois 61801 This work was supported in part by the Natic 
Foundation under Grant No. US NSF MCS73-07980 and was submitted ... 



Result # 5 Relevance: 00- ■ 

S.A. Lebedev and the Birth of Soviet Computing 

1994-04-30 IPCOM000129792D Eni 

In this article, we study the life and work of Sergei Alekseevich Lebedev, one of the wo 
in digital computing. Lebedev, working in Kiev, built the MESM, the first Soviet electron 
stored-program computer (1947-1951). In 1950, Lebedev moved to Moscow, ... 



Result # 6 Relevance: 00 ■■ : i' 

RELATIONAL DATABASE SYSTEM IMPLEMENTATIONS - A SURVEY 

1978-01-31 IPCOM000151421D En< 

Report No. UIUCDCS-R-78-913 RELATIONAL DATABASE SYSTEM IMPLEMENTATIONS A 
Won Kim January 1978 Department of Computer Science University of Illinois at Urbar 
Urbana, Illinois 61801 This work was supported i n part by the National ... 



Result # 7 Relevance: 

In-Process Manufacturing Quality Control 

1971-01-31 IPCOM000149024D Eni 



https://priorartipxom/searc^ 10/16/07 



(USPTO) IP.com Prior Art Database 



Page 2 of 2 



Donald E. Lewin January 1971 PROJECT MAC MASSACHUSETTS INSTITUTE OF TECHNO 
Cambridge Massachusetts 02139 I N-PROCESS MANUFACTURI NG QUAL ITY CONTROL* 
thesis develops a methodology for designing plans for the allocation of in-process ... 



This article describes the computing problems in the field of tidal calculations in the Net 
between 1920 and 1960. These calculations were necessary to predict the changes in tl 
movements caused by hydraulic works like the Zuiderzee Works and the Delta Works. . 



Result # 9 Relevance: 00 : (V^ : ' : ;? 

Network Access to Multimedia Information (RFC1614) 

1994-05-01 IPCOM000002450D Em 

This report summarises the requirements of research and academic network users for r 
to multimedia information. It does this by investigating some of the projects planned oi 
underway in the community. Existing information systems such as Gopher, WAIS ... 



Result # 10 Relevance: OO^^U 1 

Internet Security Glossary, Version 2 (RFC4949) 

2007-08-01 IPCOM000157212D En< 

This Glossary provides definitions, abbreviations, and explanations of terminology for ir 
system security. The 334 pages of entries offer recommendations to improve the comp 
written material that is generated in the Internet Standards Process (RFC ... 



Displaying page t of 3 << i < back i next> i ust» 



Search query: ((relevant and pages) and ranking) and words 
New search | Modify this search | Search within current results 



Result # 8 




Tidal Calculations in The Netherlands, 1920-1960 



1992-09-30 IPCOM000129753D 



En< 



https://priorarUpxom/search^ 10/16/07 



(USPTO) IP.com Prior Art Database 



Page 1 of 2 



: ; : r Prl o r^r t Da t '^'Base :: 



USPTO 



Search 

Full Text 
Concept 
Document ID 
Recent Disclosures 

Other _ 

Prior Art Home 

Support 

Logout 



Displaying records #11 through 20 out of 28 

Result # 11 Relevance: OO^'^O 

CAPACITY PLANNING AND PERFORMANCE MANAGEMENT METHODOLC 

1982-08-31 IPCOM000148602D En* 

Washington Systems Center Gaithersburg, Maryland Technical Bulletin CAPACITY F 
PERFORMANCE MANAGEMENT METHODOLOGY by Richard M. Armstrong 6622-9288-00 
The information contained in t h i s document has not been submitted t o any ... 

Result # 12 Relevance: OO' 

Test and evaluation community network (TECNET) (USH0002111) 



2004-11-02 



IPCOM000131008D 



En. 



Test and Evaluation Community Network (TECNET) is a set of software and safeguards 
on a standard UNIX based computer to provide the following capabilities in a secure fas 
World Wide Web operating over the Internet: Simple Mail ... 



Result # 13 



Relevance: 



A Review of Knowledge Based Problem Solving As a Basis for a Genet 
Experiement Designing System 

1977-03-31 IPCOM000150623D En< 

READ INSTRUCTIOXS BEFORECOMPLETINGFORM 3. RECIPIENTS CATALOG NUMBER 5. 
REPORT PERIOD COVERED SECURITY CLASSIFICATION OF THIS PAGE (When Data Ent 
RE PORTDOCU MENTATION PAGE 1. REPORT NUMBER 2. GOVT ACCESSION NO. STAN-C 
TITLE ... 



Result # 14 Relevance: ^ y 

Requirements for an Experimental Programming Environment 

1980-12-31 IPCOM000128911D En< 

Computer software costs rise steadily, as we expand our ambitions to include ever mor 
systems using ever cheaper hardware. The software that we can produce, and the rate 
can produce it, are too often limiting factors in our research within the Xerox Palo ... 



Result # IS Relevance: 0 ; ' 

Site Security Handbook (RFC2196) 



1997-09-01 



IPCOM000002754D 



Ent 



This handbook is a guide to developing computer security policies and procedures for si 
systems on the Internet. The purpose of this handbook is to provide practical guidance 
administrators trying to secure their information and services. The subjects ... 



Result # 16 Relevance: 0-;;> : .':-.' 

Site Security Handbook (RFC1244) 

1991-07-01 IPCOM000002060D 

1.1 Purpose of this Work 



Ent 



Result # 17 Relevance: £J ' 

Domain-Based Application Service Location Using SRV RRs and the D\ 
Delegation Discovery Service (DDDS) (RFC3958) 



2005-01-01 



IPCOM000033945D 



Eni 



https://priorartjpxom/searc^ 10/16/07 



(USPTO) rP.com Prior Art Database 



Page 2 of 2 



This memo defines a generalized mechanism for application service naming that allows 
location without relying on rigid domain naming conventions (so-called name hacks). Tl 
defines a Dynamic Delegation Discovery System (DDDS) Application to map domain na 



Result # 18 Relevance: y _ 

Memory Management in the Programming Language ICL 

1983-12-31 IPCOM000127931D Eni 

This paper presents the issues involved in implementing the programming language ICI 
the details of the implementation, with special emphasis on aspects of the data managt 
While the structures and algorithms presented here apply to all ... 



Result # 19 Relevance: 0;. . 

Applications-oriented AI Research: Medicine 

1979-08-31 IPCOM0001S0643D Eni 

UNCLASS I F I ED StZCURITY CLASSIFICATION OF THIS PAGE. (I! nan Dotn Enlercd) REI 
DDCUIAENTATION PAGE READ INSTRUCTIONS BEFORE COI.IPLETIh'G FORM 3. RECIPII 
CATALOG NUMBER 1. FIE PORT NUMBER 2. GOVT ACCESSION NO I Applications-oriente 
Research: ... 



Result # 20 Relevance: 0 /\ 

Dynamic Storage Allocation Techniques 

1976-04-30 IPCOM000147961D Em 

Charles Burr Weinstock April 1976 Department of Computer Science Carnegie-Mollon 
Ur\iversityPittsburgh, Pennsylvania 15213 Sukmitted to Carnegie-Mellon University in 
fulfillment of the requirements for the degree of Doctor of Philosophy, This. ... 



Displaying page 2 of 3 « first i < back i next> i u\st>> 



Search query: ((relevant and pages) and ranking) and words 
New search | Modify this search | Search within current results 

Copyright ® 2007 V- w-, }y<. 'V< rjyhfo rosorv* 



https://priorart.ip.com/search.jsp?searchType=dtFreeTextSearch&scache=17227669&query... 10/16/07 



(USPTO) IP.com Prior Art Database 



Page 1 of 2 



■. v.v».v v.v.v.v. .-.v.-. 



October 16, 200? 



: : Pft6Mf Database; 



USPTO 




Search 

Full Text 
Concept 
Document ID 
Recent Disclosures 

Other 

Prior Art Home 

Support 

Logout 



Displaying records #21 through 28 out of 28 



Result # 21 Relevance: iQ 

Context and Goals for Common Name Resolution (RFC2972) 



2000-10-01 



IPCOM00000S16SD 



Em 



This document establishes the context and goals for a Common Name Resolution Protoi 
the terminology used concerning a "Common Name" and how one might be "resolved", 
establishes the distinction between "resolution" and more ... 



Result # 22 Relevance: '. 

REASONING ABOUT RESOURCE GOALS: A THESIS PROPOSAL 



1988-11-30 



IPCOM000150454D 



Em 



November, 1988 REASONING ABOUT RESCIURCE GOALS: A THESIS PROPOSAL C.W. L 
Project Working Paper No. 96 Department of Computer Science Hill Center for the Math 
Sciences Busch Campus, Rutgers University New Brunswick, New Jersey 08903 The ... 



Result # 23 Relevance: 0 

Version Spaces: An Approach to Concept Learning 
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CHARACTERIZATION OF A DISTRIBUTED DATA BASE SYSTEM 
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BY ENRIQUE GRAPA Department of Computer Science University of Illinois Urbana, III in 
work was supported in part by the Department of Computer Science, the Center for Ad 1 
Compuation and the Command and Control Technical Center and was submitted ... 
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PR-LEAP: A LEARNING APPRENTICE APPROACH TO ACQUIRING PREFK 
RULES FOR A VLSI DESIGN PROBLEM SOLVER 

1987-08-31 IPCOM000150397D En< 

Working Paper 1. Introduction An active area of research in the 1980's has been the au 
the design process [6,9]. Before design automation can be realized, it is necessary to h 
understanding of a design model (201 appropriate to the target domain. One ... 
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How to Advertise Responsibly Using E-Mail and Newsgroups or - how 
$$$$$ MAKE ENEMIES FAST! $$$$$ (RFC3098) 
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This memo offers useful suggestions for responsible advertising techniques that can be 
internet in an environment where the advertiser, recipients, and the Internet Communi 
in a productive and mutually respectful fashion. Some measure of ... 
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An Annotated Bibliography on the Construction of Compilers 
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An Annotated Bibliogmphy on the Construction of Compilers 191 'BaryW. Pollack Compi 
Department Stanford University This bibliography is divided into 9 sections: General In! 
Compiling Techniques Syntax- and Bs se -Directed Parsing Parsing ... 
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Stencils and Problem Partitionings: Their Influence on the Performan< 
Multiple Processor Systems 

1899-12-30 IPCOM000149170D Ent 

Stencils and Problem Partitionings: Their Influence on the Performance of Multiple Proci 
Systems .t Daniel A. Reed Department of Computer Science University of IllinoisUrbanc 
61801 Loyce M. Adamn t Department of Applied Mathematics ... 
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Google Webmaster Tools . Sponsored Links 

www.google.com/webmasters/tools Which URL on your^ftfe r WS , ^er mk 

highest PageRank? Find out more. Keywords Analyzer 

Research profitable keywords with 
the top keyword analyzer tool 
Tip: Save time by hitting the return key instead of clicking on "search" www .AdGooroo.com/ 

rppn Hierarchy in Web Page Similarity Link Analysis ' keyword Ranking 

File Formal: PDF/Adobe Acrobat - View as HTML Download Competitors Keywords 

Features to be used as part of a similarity query either can be intrinsic to Search by Domain, Export to Excel 
the document itself,, or extrinsic: how the document relates to the rest of www.SpyFu.com 
the ... 

wiki. commerce. net/images/9/9c/C N-TR-06~02.pdf - SimO&LOag^ 

[pdf] Improving Authoritative Sources in a Hyperlinked Environment via 

File Format: PDF/Adobe Acrobat 

A collection of hyperlinked pages is viewed as a directed graph Quantitatively, this 

"extrinsic" measurement focuses on the relative directions of... 

www.es. princeton. edu/~jherbach/hits-sw.pdf - Similar pages 

[pdf] EntitvRank: Searching Entities Directly and Holistically 

File Format: PDF/Adobe Acrobat - View as HTML 

achieve the central mission of ranking relevant pages high ument collection- what 

intrinsic structure or extrinsic meta data is ... 

www-forward.cs.uiuc.edu/pubvS/2007/entityrank-vidb07-cyc-jui07, pdf - Similar pages 
Database access system - Patent 7181438 

Typically, a user interface, such as a browser, is utilized to access hyperlinked information 
resources. The user interface displays information "pages" or ... 

wvAvireepatentsonline.com/7181438.htmi - 250k - Cached - .§im!laLp.§SS§. 

Method and system for probabilistically quantifying and ... 

[0072] Extrinsic patent metrics generally measure or report qualities or words can be 

used to provide improved keyword searching and/or further ... 

www.lreepatenlsonline.com/20070073748.html - 1 74k - Cached. - SjMjar.Rages 
[ jyioreiesuM.ftonx.^w J ,f reepatei^sof^iM^fr! ] 

Knowledge portals and the emerging digital knowledge workplace 

For some operations, the distinction between intrinsic and extrinsic features is .... For 
example, in the IBM Text Analyzer system, the features are words; ... 

www.research.ibmxonri/joumal/sj/404/mack.hirni - 135k - Cached - Simijaj„&ages 

Knowledge portals and the emerging digital knowledge workplace 

For some operations, the distinction between intrinsic and extrinsic results related to a 

topic are organized in hyperlinked briefs for reuse in ... 

www.encyclopedia.com/doc/1G1 -82373863,html -- 322k-^CgcMd- SjM!M.B.aggs 

Computer graphic display visualization system and method - US „■ 

The automated classifications may also provide extrinsic distinctions, e.g., commercially 
motivated distinctions, rather than purely intrinsic content ... 
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ippfi Summary in Context: Searching Versus Browsing 
File Format: PDF/Adobe Acrobat - View as HTML 

the query. Keyword -in-context (KWIC) or snippet summaries are query-based sum-, maries 
that show which query terms appear in a document and the words ... 

ai.arizona.edu/go/intranet/papers/ACM,„1 „2006,pdf - Similar pages 

rppR Summary in Context: Searching Versus Browsing 

evaluation of the generic AZ Summarizer using an intrinsic study. The second, involved the 
extrinsic evaluation of four different types of summaries using ... 

portal .acm.org/ft_gateway ,cfm?id~ 1 1 25861 &ty pe=pdf • Sj mi jar .pages 
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rppn On Recommending Jonathan Furner 
File Format: PDF/Adobe Acrobat - View as HTML 

Many recommender systems currently incorporate mechanisms for ranking documents 

extrinsic— in contrast with those intrinsic criteria that are ... 

poiaris.gseis. ucia.edu/jfurner/jasist02, pdf - Similar pages 

rppn Scholarly Communication and Bibliometrics Christine L Borgman and ... 
File Format: PDF/Adobe Acrobat - View as HTML 

engaged in the creation of original scholarly works — in other words, authors (1998) 

describe their PageRank formula for ranking individual Web pages ... 

polaris.gseis.ucla.edu/jfurner/arist02.pdf- Sjmijar.Bages 

Joho the Blog: taxonomy Archives 

PLoS has an "intrinsic tension" she says because most of the people who Google 

knows the words on the pages, but doesn't know what the pages are ... 

wvAv.hyperorg.com/blogger/mtarchive/taxonomy/ - 661 k - CachM - Sjfrs!l.ar.R.aass. 

All words legal in 0WL2 (incl. expurgated ones) 

All words legal in OWL2 (incl. expurgated ones) as of March 1, 2006 EXTREMISM 

EXTREMIST EXTREMITY EXTRICATE EXTRINSIC EXTROVERT EXTRUDERS 
EXTRUDING ... 

\»vww-personaLumich.edu/-nichoisa/scr/ow!2.html - Simj)M.pas$& 
words txt 

... extremes 9900 extremities 9901 extremum 9902 extrinsic 9903 extrusion 9904 ey 

hyperkahler 12812 hyperlink 12813 hyperlinked 12814 hyperlinks 12815 ... 

www.cis.hut.fi/Opinnot/T-61 .5060/2007/words.ixt - 425k - Cached - Similar pages 

pdf] 8 Autonomous Authoring Tools for Hypertext 

—The Intrinsic value of the terms in c itself. —The extrinsic value of the cards 

immediately pages using their hyperlinked neighbouring pages. ... 

portai.acm.org/ft_gateway.cfm?id«1267072& 

type=pdf&colf-GUfDE&df=&CFiD-7S683&CFTOKEN=4.,. - Similar pages 
[pdf] On recommending 

objects in the form of a single ranking for the individual relevance, relatedness, quality- 
intrinsic or extrinsic. Evidence [7] ... 

dx.doi. org/1 0.1 002/asi.1 0080 ■• SjiMarpases 

rppn Cognitively Flexible Hypertext in an Object-Oriented Programming ... 
File Format: PDF/Adobe Acrobat - View as HTML 

For the purposes of this study intrinsic motivation is defined as the mean score on. items 
1-4 of the motivation survey and, extrinsic motivation is defined ... 

krex.k-state.edu/dspace/bitstream/2097/1 24/3/CeciISchmidt2005.pdf - Similar. j>ages 

rppFi PL Tutorial - Advanced 
File Format: PDF/Adobe Acrobat 

words BackRub stores which pages every web page links to access considerations 

are extrinsic data. The focus on intrinsic data in no way demeans the ... 

ei.cs.vt.edu/-dHb/adv200006.pdf •• Siraijar.B.§S©s 



Web Images Video News Maps Gmail more t 



Sign in 



Google 



|+ ran king +hyperlinked +words +keyword +sco! l^^f<% : - 




http;//ww.google.com/search?^ 10/16/07 



+ranking +hyperlinked +words +keyword +score +pages +extrinsic +intrinsic - Google S... Page 2 of 2 



[pdfi Scholarly communication and bibliometrics 

mula for ranking individual Web pages on the basis not only of (a) the Effective use of 

commercial databases of hyperlinked journal articles ... 

doi.wiiey.com/10.1002/aris.1440360102 - Sim{laL{iages 
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[pdf] Argumentative Zoning: Information Extraction from Scientific Text 
File Format: PDF/Adobe Acrobat 

Title Words (Cont-2): Cont-2 draws its definition of what a good keyword is from 

Intrinsic Evaluation. 221. Condition. Score. Start. Score = 0 ... 

www,cLcam.ac.uk/-sht25/thesis/t.pdf - Similar pages 

Achieving self-organisation in network-based learning environments 
The word motivation in this context is clearly referring to extrinsic rather than intrinsic 
stimuli, for it is only in a failure to achieve such goals that ... 

wvw.it.bton.ac.uk/staff/#^ » 634k - 

Cached. - Similar pages 

Technology & Marketing Law Blog: Marketing Archives 

However, the Wikipedia model provides little extrinsic benefits for In other words, if 

unwanted marketing arrives in the consumer's inbox, ... 

blog.ericgoldman.org/archives/marketing/ ~ 440k - Cached - Sjmi|ar p.&ges 

pdf] 7 Systems 

ranking of knowledge elements: search results are presented in an order which 

individual information objects and can be intrinsic or extrinsic. ... 

www.springerlink.com/index/wS01 p464867937x0.pdf - Similar pages 

rppFi ATHABASCA UNIVERSITY WEB DESIGN FOR EFFECTIVE ONLINE TRAINING AND ... 

Fife Format; PDF/Adobe Acrobat 

bookmark pages despite the use of frames. Score: Medium. 2. Gratuitous Use of Bleeding- 
Edge motivations primarily intrinsic or extrinsic influences? ... 

auspace.athabascau.ca:8080/dspace/bi[s[ream/2149/555/1/Patsula.pdl - Similar pages 

[pdf] Proceedings of the... 
File Format: PDF/Adobe Acrobat 

Proceedings of the ACL 2007 Demo and Poster Sessions, pages 1-4, dinary words, 

we only assign the score -1 to proper names. ... 
acl.ldc.upenn.edu/P/PQ7/P07-2.pdf - Similar pages 

[pdf] Proceedings of the... 
File Format: PDF/Adobe Acrobat 

* dinary words, we only assign the score -1 to proper names tion at DUG and a 

suggestion for extrinsic evaluation. In ... 

acl.eldoc.ub.rug.nl/mirror/P/P07/P07-2.pdf - Similar pages 

[pdf] Executive Summary 
File Format: PDF/Adobe Acrobat 

individual intrinsic and extrinsic characteristics of the system are The Internet-based 

references at the end of each article should be hyperlinked ... 

wwwJcs.forth.gr/ftp/tech'-repo^ 

orientation _inspection_European_eServices.pdf - SL?ni|aj:.^ages 

tpdr The Multi-Model. Metadata Driven Approach to Personalised ... 

File Format: PDF/Adobe Acrobat 

the pages. APeLS uses Jakarta Tomcat to deliver the personalised course 
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https://www.cs.tcd .ie/Owen.Conian/ptJblications/Conlan Thesis.pdf - Similar pages 

rppn An Architecture for Collaborative Math and Science Digital Libraries 
Fife Format; PDF/Adobe Acrobat 

contains all intrinsic metadata and some extrinsic metadata (for example, an optional 

comment, and an optional hyperlinked list of context objects. ... 

br.endernet.org/~akrowne/rny_papers/thesis/akrowne Jhesis.pdf ~ Sio^OsLliages 
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tppfi An Architecture for Collaborative Math and Science Digital Libraries 
File Format: PDF/Adobe Acrobat 

contain mathematics, as well as hyperlinked terms and phrases. These terms contains 

all intrinsic metadata and some extrinsic metadata (for example, ... 

physjob.nudi.org/-akrowne/thesis/akrowne ...thesis . 20030821 .pdf - Simil3rj>aaes 

[pdf] The Theory and Practice of Co-active Search 
File Format: PDF/Adobe Acrobat - View as HTML 

The intrinsic value of the terms in c itself. • The extrinsic value of the cards schemes 

for web pages using their hyperlinked neighbouring pages. ... 

www.cs,nott.ac.uk/Research/webtech/theses/truran-thesis.pdf » .Sjmy^rjJBaes 

a A AA AAA Aachen aardvark aardvarks Aaren Aarhus Aarika Aaron ab ... 

... hyperinflation Hyperion hyperlink hyperlinked hyperlinking hyperlinks wordless 

wordlessly wordplay wordplays words Wordsworth wordy wore work ... 

www.timmerize.org/01 0906/words.txt - .SjfflJar.P.9fles 

pdf] Project ref. no. IST-1 999-1 2611 CLASS Project title Collaboration ... 
File Format: PDF/Adobe Acrobat - View as HTML 

an extrinsic evaluation where the results of the summarisa- words are rare, the score 

is fixed to 10 (there are 4 or less, content words in a query). ... 

www.limsi.fr/TLP/CLASS/CiassD43.pdf - &mj)M.pages 

[pdh loannis Hatzilvgeroudis (Editor) 
Fife Format: PDF/Adobe Acrobat 

simultaneously. Mathan [26] classify feedback by the source to intrinsic and. extrinsic to a 
task environment. According to Mathan [26] ... 

mmlab.ceid.upatras.gr/aigroup/ht05-wbies/C!AH-2005.pdf - SJMiMP.a&es 

[pdf] Information and knowledge management in support of legal research — 
File Format: PDF/Adobe Acrobat 

contract under which the individual gains intrinsic and extrinsic rewards sufficient to 

This has lead to what is known today as hyperlinked browsing. ... 
etd.rau.ac.za/theses/availabie/etd-1 1 162004-094459/restricted/DuPlessisthesis.pdf - 

sjn)jJai.p.aae.s 

[pdf] Educational Technology & Society 
File Format: PDF/Adobe Acrobat 

leading to cognitive overload (intrinsic plus extrinsic cognitive load exceeds for 

evaluation used a book metaphor similar to a hyperlinked PDF file. ... 

www.ifets Jnfo/journals/8_1/ets_8_1 .pdf ■- SimlM. .pages 

UWTV: Programs - Social Sciences 

Dealing with Data: Classification, Clustering and Ranking principle types of data, 

representing the intrinsic and extrinsic properties of entities'. ... 

www.uwtv.org/programs/viewsubject.aspx?fid=569 - 977k - Cached - Sin$aL£^s 

[pdf] The Acquisition and Representation of Word Meaning 
File Format: PDF/Adobe Acrobat 

Table 1 : Nearest neighbors for several English words as obtained by the Each page 
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rpon AUTOMATIC DOCUMENT-LEVEL SEMANTIC METADATA ANNOTATION USING ■„ 

Fife Format; PDF/Adobe Acrobat 

of it (e.g. title of a resource); extrinsic sources, which are not contained within the 

hyperlinked set of documents, such as the World Wide Web, ... 

eprints,ecs.soton.ac.uk/141 81/01 ZHend_Thesis.pdf - Simiiar pages 
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[rtf] TrendSiters Digital Content And Web Technologies 3rd EDITION Sam „■ 

File Format: Rich Text Format 

Free content, in other words, limits both the downside and the upside of the Exogenic- 

Extrinsic Meaning Cultures versus Endogenic-lntrinsic Meaning ... 

www.narcissistic--abuse.com/digitalcontent.rtf - Similar pages 

Patent Prospector: Patent Hawk Archive 

But 8r5 is released in pdf format only; no hyperlinked web (html) version As in Phillips 

v. AWH, intrinsic evidence trumped extrinsic. ... 

www.patenthawk.com/blog/patenthawk.html - Sjmiiarp/^es 

[pdh T C D J P R 

File Format: PDF/Adobe Acrobat 

deprived of any predetermined origin, creates an intrinsic In other words, did relevant 

documents, have a higher ranking in the re-ranked list? ... 
www.gsu.tcd.ie/JPR/200e-2007/fullJpr_2007.pdf - SimjiaLpages 

A Glossary of Publishing Terms 

Also, a word made by putting together parts of other words (eg: motel, guesstimate, 

has substituted intrinsic value with extrinsic commodity. ... 

www.contexture.us/PUBTERMS/PUBTERMS.HTM - 788k - Cached - Similar pages 

2007-1 0-1 1T09:16:59Z http://vvw.biomedcentral.eom/oai/2.0/ oai ... 

This means more than online access to articles, hyperlinked references and A further 

6 gene sets directly involve the extrinsic and intrinsic ... 

www.biomedcentrai.cojTVoai/2.0?ve^ 
3A1 002 - Similar pages 

pdf] WPS Single Modality Processing State of the Art Report 
Fife Format: PDF/Adobe Acrobat 

tion in a hyperlinked environment. In Proc. 21st ACM International. Conference on 
Research and Development in Information Retrieval. SIGIR-98, pages 104-111 ... 

www-rora^inria.fr/imedia^ 

bill kerr 

It is the Web's hyperlinked nature to pull our attention here and there to increase a 

score - this is a simpler solution than my conditional hint. ... 

biilkerr.blogspot.com/ - 977k - Cached - SimUar pages 

Scrabble Dictionary ***»> SOWPODS Full List of over 160000 words. 

Full listing of acceptable words in the game of Scrabble. ... extrication extrications extrinsic 

extrinsically extrorse extroversion extroversions extrovert ... 

www.rcscrapyard.biziand.com/sow.htm SjMiar oag^ 

pdf] I : P I 

Fife Forma?; PDF/Adobe Acrobat 

encoded protein sequences to those of other proteins represents the intrinsic, and 
extrinsic approaches to gene prediction [110], which ideally should be ... 

hydra.icgeb.trieste.it/-pongor/biophys-homepag 
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Interest in the dynamic nature of hyperlinked corpora has recently been sharing 

process, we explore the extrinsic and intrinsic incentives for ... 

wvvw.socsci.uci.edu/~ssnconf/conf/SunbeltXXVProgram.pdf - Sjn)jjar„paaes 
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Preoccupations: Creativity 

Conversation is not an enterprise designed to yield an extrinsic profit he managed 150 

words in five minutes, 50 more than the best American score. ... 

wvAv.preoccupations.org/creativity/lndex.html - 977k - Cached - Simi!arpa,ges 
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File Formal: PDF/Adobe Acrobat 

age, in other words, a re-query). An additional API call willtake a score structure and a 
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95625 ABC ABMs ACLU ACTH AFAIK AFAIKs AFDC AIDS AIDS es ANSI ANSIs 

... extricates extricating extrication extrinsic extrinsically extroversion hyperinflation 

hyperlink hyperlinked hyperlinking hyperlinks hypermarket ... 

ipsc.ksp.sk/contests/ipsc2005/real/problems/f2.in - 876k - Cached - SjmM.pages 

aa aah aahed aahing aahs aal aalii aaliis aals aardvark aardvarks .„ 

... extrication extrications extrinsic extrinsically extrorse extroversion hyperkinesis 

hyperkinetic hyperlink hyperlinked hyperlinking hyperlinks ... 

www.recmath.org/contest/enabie2k.txt - Similar pages 

Official North American Scrabble Tournament Word Lists 

See also a cheat sheet of useful sub-lists, e.g. words with only vowels, 

HYPERKINESIS HYPERKINETIC HYPERLINK HYPERLINKED HYPERLINKING 
HYPERLINKS ... 

www.kisa.ca/scrabbie/ - Similar pages 
Site lndex# 



http://www.googlexom/search?q=%2Brariking+%2Bhyperlinked+%2Bw 10/16/07 



+ranking +hyperlinked +words +keyword +score +pages ^extrinsic +intrinsic - Google S... Page 2 of 2 



extraordinary 3 extrapolated 2 extrav 1 extravagana 4 extravaganza 108 extravanganza 1 
extreme 15 extremely 166 extremes 1 extremist 1 extrinsic 1 ... 

xvww.socalhoops.tierranet.com/cgi-bin/search_form.cgi?BUILDiNDEX==y - &mi|ar.pages 



Previous 1 2 3 4 5 6 7 Next 



|+ranking +hyperlink^ l^^^-lj 
Search within results | Language Tools | Search Tips 



©2007 Google - Google Home - Advertising Programs - Business Solutions - About Google 



http://ww.googlexom/search?q=%2 10/16/07 



+ranking +hyperlinked +words +keyword +score +pages +extrinsic ^intrinsic - Google S... Page 1 of 1 
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317502 the 184884 of 168196 and 140673 to 138851 a 112630 in 70155 ... 

... hyzod 7 hyphen 7 hyperopia 7 hyperlinked 7 hygienic 7 hyggem 7 hydroseal 5 

extrusions 5 extrinsic 5 extremen 5 extrait 5 extinguished 5 externer ... 
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In order to show you the most relevant results, we have omitted some entries very similar to 
the 61 already displayed. 

If you like, you can repeat the search with the omitted results included . 
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